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Foreword 



rd , 



This Technical Specification has been produced by the 3 Generation Partnership Project (3 GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

x the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the specification; 

The 3GPP Multimedia messaging service (MMS) specification consists of three 3GPP TSs; 3GPP TS 22.140, 3GPP TS 
23.140 and the present document. The TS 3GPP TS 22.140 [22]provides a set of requirements which shall be supported 
for the provision of non real-time multimedia messaging service, seen primarily from the subscriber's and service 
providers' points of view. The TS 23.140 [23] identifies the functional capabilities and information flows needed to 
support the MMS. The present document provides the details of media types, formats and codecs used by the 
MMS service 

The issue of codecs ad for MMS services has been addressed initially in TS 23.140, owned by the 3GPP T2 group. 
During the TSG-T WG2 group meeting in Edinburgh in September 2001, the TSG-T WG2 group sent a Liaison 
statement (S4-AHP040) to the 3GPP SA WG4 group, requesting that the responsibility for the specification of codecs 
and formats to be used in MMS services is transferred to SA WG4 group starting with Release 5. 

After the S A WG4 group agreed to take over this responsibility, and the present document is the result of such 
commitment on Release 6. 

For the sake of interoperability and alignment it is important there is no contradiction between the recommendations 
made in the present document and in the 26.234 specification [14]. 
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Scope 



The present document specifies the media types, formats and codecs for the MMS within the 3GPP system. The scope 
of the present document extends to codecs for speech, audio, video, still images, bitmap graphics, and other media in 
general, as well as scene description, multimedia integration and synchronization schemes. 
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3 Definitions and abbreviations 

3.1 Definitions 

For the purposes of the present document, the following terms and definitions apply: 

continuous media: media with an inherent notion of time, in the present document speech, audio and video 

discrete media: media that itself does not contain an element of time, in the present document all media not defined as 
continuous media 

scene description: description of the spatial layout and temporal behaviour of a presentation, it can also contain 
hyperlinks 

3.2 Abbreviations 

For the purposes of the present document, the abbreviations given in 3GPP TR 21.905 [1] and the following apply: 

3GP 3GPP file format 

AAC Advanced Audio Coding 

AVC Advanced Video Coding 

CC/PP Composite Capability/Preference Profiles 

DIMS Dynamic and Interactive Multimedia Scene 

DLS Downloadable Sounds 

DRM Digital Rights Management 

Enhanced aacPlus MPEG-4 High Efficiency AAC plus MPEG-4 Parametric Stereo 

EXIF Exchangeable image file format 

GIF Graphics Interchange Format 

H.263 ITU-T video codec 

ITU-T International Telecommunications Union - Telecommunications 

JFIF JPEG File Interchange Format 

JPEG Joint Picture Expert Group 

MIDI Musical Instrument Digital Interface 

MIME Multipurpose Internet Mail Extensions 

MM Multimedia Message 

MMS Multimedia Messaging Service 

MPEG Motion Picture Expert Group 

MP4 MPEG-4 file format 

PIM Personal Information Manager 

PSS Packet-switched Streaming Service 

SBR Spectral Band Replication 

SP-MIDI Scalable Polyphony MIDI 

SVG Scalable Vector Graphics 

UTF-8 Unicode Transformation Format (the 8 -bit form) 

XMF Extensible Music Format 



Media formats 



Multiple media elements shall be combined into a composite single MM using MIME multipart format as defined in 
RFC 2046 [25]. The media type of a single MM element shall be identified by its appropriate MIME type whereas the 
media format shall be indicated by its appropriate MIME subtype. 

In order to guarantee a minimum support and compatibility between multimedia messaging capable terminals, MMS 
User Agent supporting specific media types shall comply with the following selection of media formats: 
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4.1 



Text 



Plain text. Any character encoding (charset) that contains a subset of the logical characters in Unicode [2] shall be used 
(e.g. US-ASCII [3], ISO-8859-1 [4], UTF-8 [5], ShiftJIS, etc.). 

Unrecognized subtypes of "text" shall be treated as subtype "plain" as long as the MIME implementation knows how to 
handle the charset. Any other unrecognized subtype and unrecognized charset shall be treated as 
"application/octet - stream". 

Interoperability with SMS text type is according to [23]. 



4.2 Speech 



If speech is supported, the AMR codec shall be supported for narrow-band speech [26] [40] [41] [42]. 

The AMR wideband speech codec [27] [43] [44] [45] shall be supported when wideband speech working at 16 kHz 
sampling frequency is supported. 

When using speech media type alone, AMR or AMR-WB data is stored according to the file format specified in [32]. 

Multi-channel sessions shall not be used. 

4.3 Audio 

If audio is supported, then one or both of the following two audio codecs should be supported: 

- Enhanced aacPlus [49] [50] [51] 

- Extended AMR-WB [46] [47] [48] 

There is no requirement that a terminal supporting decoding by one of the codecs shall also support encoding by that 
codec. 

Specifically, based on the audio codec selection test results Extended AMR-WB is strong for the scenarios marked with 
blue, Enhanced aacPlus is strong for the scenarios marked with orange, and both are strong for the scenarios marked 
with green colour in the table below: 



Content type 
Bit rate 


Music 


Speech over Music 


Speech between 
Music 


Speech 


14 kbps mono 










1 8 kbps stereo 










24 kbps stereo 










24 kbps mono 










32 kbps stereo 










48 kbps stereo 











More recent information on the performance of the codecs based on more recent versions of the codecs can be found in 
TR 26.936 [60]. 

Enhanced aacPlus decoder is also able to decode MPEG-4 AAC LC content. 

Extended AMR-WB decoder is also able to decode AMR-WB content. 

In addition, MPEG-4 AAC Low Complexity and MPEG-4 AAC Long Term Prediction object types [19] may be 
supported. The maximum sampling rate to be supported by the decoder is 48 kHz. The channel configurations to be 
supported are mono (1/0) and stereo (2/0). 
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4.4 Synthetic audio 



If synthetic audio is supported, the Scalable Polyphony MIDI (SP-MIDI) content format defined in Scalable Polyphony 
MIDI Specification [28] and the device requirements defined in Scalable Polyphony MIDI Device 5-to-24 Note Profile 
for 3GPP [29] should be supported. 

SP-MIDI content is delivered in the structure specified in Standard MIDI Files 1.0 [31], either in format or format 1. 

In addition the Mobile DLS instrument format defined in [38] and the Mobile XMF content format defined in [39] 
should be supported. 

A MMS client supporting Mobile DLS shall meet the minimum device requirements defined in [38] in section 1.3 and 
the requirements for the common part of the synthesizer voice as defined in [29] in sections 1.2.1.2. If Mobile DLS is 
supported, wavetables encoded with the G.711 A-law codec (wFormatTag value 0x0006, as defined in [38]) shall also 
be supported. The optional group of processing blocks as defined in [39] may be supported. Mobile DLS resources are 
delivered either in the file format defined in [38], or within Mobile XMF as defined in [39]. For Mobile DLS files 
delivered outside of Mobile XMF, the loading application should unload Mobile DLS instruments so that the sound 
bank required by the SP-MIDI profile [29] is not persistently altered by temporary loadings of Mobile DLS files. 

Content that pairs Mobile DLS and SP-MIDI resources is delivered in the structure specified in Mobile XMF [39] . As 
defined in [39], a Mobile XMF file shall contain one SP-MIDI SMF file and no more than one Mobile DLS file. MMS 
clients supporting Mobile XMF must not support any other resource types in the Mobile XMF file. Media handling 
behaviours for the SP-MIDI SMF and Mobile DLS resources contained within Mobile XMF are defined in [39]. 



4.5 Still Image 



If still images are supported, ISO/IEC JPEG [8] together with JFIF [9] shall be supported. The support for ISO/IEC 
JPEG only apply to the following two modes: 

- mandatory: baseline DCT, non-differential, Huffman coding, as defined in table B.l, symbol 'SOF0' in [8]; 

- optional: progressive DCT, non-differential, Huffman coding, as defined in table B.l, symbol 'SOF2' [8]. 

For JPEG baseline DCT, EXIF compressed image file format should also be supported, as defined in [54]. In that case 
there is no requirement for the MMS client to interpret or present the EXIF parameters recorded in the file. 

4.6 Bitmap graphics 

If bitmap graphics is supported, the following bitmap graphics formats should be supported: 

- GIF87a[15]; 

- GIF89a, [16]; 

- PNG, [17]. 

4.7 Video 

If video is supported, ITU-T Recommendation H.263 profile level 45 [10] [11] shall be supported. In addition: 

- H.263 Profile 3 Level 45 [10] [11]; 

- MPEG-4 Visual Simple Profile Level 0b [ 1 2] ; 

- H.264 (AVC) Baseline Profile Level lb [52] [53] with constraint_setl_flag=l; 

should be supported. There are no requirements on output timing conformance of H.264 (AVC) decoding (Annex C of 
[52]). 

A video buffer model defined in Annex G of document [14] should be used with H.263 and MPEG-4. It shall not be 
used with H.264 (AVC). 
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NOTE: ITU-T Recommendation H.263 profile has been mandated to ensure that video-enabled MMS supports 
a minimum baseline video capability. Both H.263 and MPEG-4 visual decoders can decode an H.263 
profile bitstream. It is strongly recommended, though, that an H.263 profile bitstream is transported 
and stored as H.263 and not as MPEG-4 visual (short header), as MPEG-4 visual is not mandated by 

MMS. 



4.8 Vector graphics 



If 2D vector graphics is supported, Scalable Vector Graphics (SVG) Tiny 1.2 [20] [21] and ECMAScript [55] shall be 
supported. 

NOTE 1: The compression format for SVG content is GZIP [35], in accordance with the SVG specification [20]. 

NOTE 2: Only media formats supported by MMS, as specified in clause 4 of this specification, shall be used. MMS 
clients do not support the Ogg Vorbis format. 

NOTE 3: Content creators of SVG Tiny 1.2 for MMS clients are strongly recommended to follow the content 
creation guidelines provided for PSS clients in Annex L of [14]. 

NOTE 4: If SVG Tiny 1.2 will not be published within a reasonable timeframe, the decision to adopt SVG Tiny 1.2 
in favour of SVG Tiny 1 . 1 may be reconsidered. 

4.9 File Format for video and associated speech/audio media 
types 

To ensure interoperability for the transport of video and associated speech/audio and timed text in an MM, the 3 GPP 
file format with Basic profile shall be supported. 

The usage of the 3 GPP file format shall follow the technical specifications and the implementation guidelines specified 
in TS 26.233 [33] 

NOTE: When using speech media type alone, AMR or AMR-WB data is stored according to the file format 
specified in [32] . 

4.1 Media synchronization and presentation format 

The 3GPP MMS uses a subset of SMIL 2.0 [24] for media synchronization and scene description. MMS clients and 
servers with support for media synchronization and scene descriptions shall support the 3 GPP SMIL Language Profile 
defined in [34] . 

This profile is a subset of the SMIL 2.0 Language Profile but a superset of the SMIL 2.0 Basic Language Profile. 
Document [34] also includes an informative annex A that provides guidelines for SMIL content authors. 

Additionally, XHTML Mobile Profile [30] for scene description should be supported. MMS clients and servers with 
support for scene descriptions based on XHTML shall support XHTML Mobile Profile [30], defined by the WAP 
Forum. 

XHTML Mobile Profile is a subset of XHTML 1.1 but a superset of XHTML Basic. 

4.11 Timed text 

If timed text is supported, MMS clients shall support [35] with 3GP files using Basic profile [33]. 

4.12 Digital Rights Management 

If Rights Management is supported, OMA Digital Rights Management (DRM) 1.0 [56] [57] [58] shall be supported. 
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4.13 PIM 

If Personal Data Interchange is supported this shall be done according to the OMA vObject Minimum Interoperability 
Profile [59]. 

4.14 Dynamic and Interactive Multimedia Scene 

If dynamic and interactive multimedia scene is supported, MMS clients and servers shall support 3GPP TS 26.142 [61]. 
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